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Abstract 

Network coding theory studies the transmission of information in networks whose vertices 
may perform nontrivial encoding and decoding operations on data as it passes through the 
network. The main approach to deciding the feasibihty of network coding problems aims to 
T— H reduce the problem to optimization over a polytope of "entropic vectors" subject to constraints 

imposed by the network structure. In the case of directed acyclic graphs, these constraints are 
completely understood, but for general graphs the problem of enumerating them remains open: 
^ it is not known how to classify the constraints implied by a property that we call serializability, 

which refers to the absence of paradoxical circular dependencies in a network code. 

In this work we initiate the first systematic study of the constraints imposed on a network 
00 code by serializability. We find that serializability cannot be detected solely by evaluating the 

Shannon entropy of edge sets in the graph, but nevertheless, we give a polynomial-time algorithm 
that decides the serializability of a network code. We define a certificate of non-serializability, 
I— H called an information vortex, that plays a role in the theory of serializability comparable to the 

C/J role of fractional cuts in multicommodity flow theory, including a type of min-max relation. 

, Finally, we study the serializability deficit of a network code, defined as the minimum number 

of extra bits that must be sent in order to make it serializable. For linear codes, we show that it 
^-H is NP-hard to approximate this parameter within a constant factor, and we demonstrate some 

^ surprising facts about the behavior of this parameter under parallel composition of codes. 
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1 Introduction 



Network coding theory studies the transmission of information in networks whose vertices may 
perform nontrivial encoding and decoding operations on data as it passes through the network. 
More specifically, a network code consists of a network with specified sender and receiver edges and 
coding functions on each edge. The classic definition of a network code requires that each vertex 
can compute the message on every outgoing edge from the messages received on its incoming edges, 
and that each receiver is sent the message it requires. In directed acyclic graphs, a network code 
that satisfies these requirements specifies a valid communication protocol. However, in graphs 
with cycles this need not be the case; the definition does not preclude the possibility of cyclic 
dependencies among coding functions. Therefore, in graphs with cycles we also require that a 
network code is serializable, meaning it correctly summarizes a communication protocol in which 
symbols are transmitted on edges over time, and each symbol transmitted by a vertex is computed 
without knowledge of information it will receive in the future. The present paper is devoted to the 
study of characterizing the constraint of serializability. 

Motivation. The central question in the area of network coding is to determine the amount by 
which coding can increase the rate of information flow as compared to transferring information 
without coding. Crucial to answering this question is developing tools to find upper bounds for the 
network coding rate. The question of serializability must be considered in order to determine tight 
upper bounds on network codes in cyclic graphs. Determining tight upper bounds is especially 
relevant to one of the most important open problems in network coding, the undirected k-pairs 
conjecture, which states that in undirected graphs with k sender- receiver pairs, coding cannot 
increase the maximum rate of information flow; that is, the network coding rate is the same as 
the multicommodity flow rate. Apart from its intrinsic interest, the conjecture also has important 
complexity-theoretic implications: for example, if true, it implies an affirmative answer to a 20- 
year-old conjecture regarding the I/O complexity of matrix transposition [1]. 

Almost all efforts to produce upper bounds on the network coding rate have focused on the 
following construction. We regard each edge of the network as defining a random variable on 
a probability space and then associate each set of edges with the Shannon entropy of the joint 
distribution of their random variables. This gives us a vector of non-negative numbers, one for each 
edge set, called the entropic vector of the network code. The closure of the set of entropic vectors of 
network codes forms a convex set, and network coding problems can be expressed as optimization 
problems over this set [TJj. In much previous work, tight upper bounds have been constructed by 
combining the constraints that define this convex set. However, this technique is limited because 
there is no known description of all these constraints. 

There are two types of constraints: the purely information-theoretic ones (i.e., those that hold 
universally for all n-tuples of random variables, regardless of their interpretation as coding functions 
on edges of a network) and the constraints derived from the combinatorial structure of the network. 
The former type of constraints include the so-called Shannon and non-Shannon inequalities, and 
are currently a topic of intense investigation O HI [131 [121 HZ] • The latter type of constraints — 
namely, those determined by the network structure — are trivial to characterize in the case of 
directed acyclic graphs: if one imposes a constraint that the entropy of each node's incoming edge 
set equals the entropy of all of its incoming and outgoing edges, then these constraints together 
with the purely information-theoretic ones imply all other constraints resulting from the network 
structure [16]. However, in graphs with cycles there are additional constraints determined from the 
network structure. 

In a series of work on finding network coding upper bounds, large classes of information inequal- 
ities in graphs with cycles were discovered independently by Jain et al. Kramer and Savari [TO] . 
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and Harvey et al. |6]. These go by the names crypto inequality, PdE hound, and informational 
dominance hound, respectively. In various forms, all of them describe a situation in which the 
information on one set of edges completely determines the information on another set of edges. A 
more general necessary condition for serializability was presented in a recent paper by Harvey et 
al. [7]; we will henceforth refer to this information inequality as the Chicken and Egg inequality; 



see Theorem 3.1 Though in all the previous work the inequalities used were sufficient to prove 
the needed bounds on the specific graphs analyzed in the paper, no one has asked if this set of 
inequalities provides a complete characterization of serializability. This inspires the following nat- 
ural questions: Are the information inequalities given in previous work sufficient to characterize 
serializability? Does there exist a set of information theoretic inequalities that gives a sufficient 
condition for serializability? Is there a finite set of information theoretic inequalities implied by 
serializability? Is there any "nice" condition that is necessary and sufficient serializability? We see 
these questions not only as interesting for a general understanding of network coding, but also a 
key ingredient to eventually developing algorithms and upper bounds for network coding in general 
graphs. 



Our contributions. Our work is the first systematic study of criteria for serializability of net- 
work codes. We find that serializability cannot be detected solely from the entropic vector of the 
network code; a counter-example is given in Section |3j This leads us to focus the paper on two in- 
dependent, but dual, questions: Is there any efficiently verifiable necessary and sufficient condition 
for serializability? What is the complete set of entropy inequalities implied by serializability? 

We answer the first question in the affirmative in Section |4] by providing an algorithm to decide 
whether a code is serializable. The running time of this algorithm is polynomial in the cardinalities 
of the edge alphabets, and it is polynomial in their dimensions in the case of linear network codes. 
We answer the second question for the 2-cycle in Section [3j giving four inequalities derived from 
the network structure, and showing that any entropic vector satisfying those inequalities as well as 
Shannon's inequalities can be realized by a serializable network code. (Though structurally simple, 
the 2-cycle graph has been an important source of inspiration for information inequalities in prior 
work, including the crypto inequality f8], the informational dominance bound [6], and the Chicken 
and Egg inequality [^.) Disappointingly, we do not know if this result extends beyond the 2-cycle. 

Beyond providing an algorithm for deciding if a network code is serializable, our work provides 
important insights into the property of serializability. In Section |4] we define a certificate that we 
call an information vortex that is a necessary and sufficient condition for non-serializability. For 
linear network codes, an information vortex consists of linear subspaces of the dual of the message 
space. For general network codes, it consists of Boolean subalgebras of the power set of the message 
set. We prove a number of theorems about information vortices that suggest their role in the theory 
of network coding may be similar to the role of fractional cuts in network flow theory. In particular, 
we prove a type of min-max relation between serializable codes and information vortices: under a 
suitable definition of serializahle restriction it holds that every network code has a unique maximal 
serializable restriction, a unique minimal information vortex, and these two objects coincide. 

Finally, motivated by examples in which non-serializable codes, whose coding functions have 
dimension n, can be serialized by adding a single bit, we consider the idea of a network code being 
"close" to serializable. We formalize this by studying a parameter we call the serializahility deficit 
of a network code, defined as the minimum number of extra bits that must be sent in order to 
make it serializable. For linear codes, we show that it is NP-hard to approximate this parameter 
within a constant factor. We also demonstrate, perhaps surprisingly, that the serialization deficit 
may behave subadditively under parallel composition: when executing two independent copies of 
a network code, the serialization deficit may scale up by a factor less than two. In fact, for every 
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6 > there is a network code $ and a positive integer n such that the seriahzation deficit of $ 
grows by a factor less than 6n when executing n independent copies of Despite these examples, 
we are able to prove that for any non-serializable linear code ^> there is a constant c<i> such that the 
serialization deficit of n independent executions of $ is at least c$n. The concept of an information 
vortex is crucial to our results on serializability deficit. 



Related work. For a general introduction to network coding we refer the reader to [lllllSj . There 
is a standard definition of network codes in directed acyclic graphs (Definition 2.1 below) but in 
many papers on graphs with cycles the definition is either not explicit (e.g. [2j) or is restricted 
to special classes of codes (e.g. [S]). Precise and general definitions of network codes in graphs 
with cycles appear in [9l [H E] and the equivalence of these definitions (modulo some differing 
assumptions about nodes' memory) is proven in ^Llj. The definition for serializability that we set 
forth in Section [2] was used, but never formally defined, in [7]. In its essence it is the same as the 
"graph over time" definition given in [TT] but requires less cumbersome notation. 



2 Definitions 



We define a network code to operate on a directed multigraph we call a sourced graph, denoted 
G = {V, E, S)^S is a set of special edges, called sources or source edges, that have a head but no 
tail. We denote a source with head s by an ordered pair (•, s). Elements of EL) S are called edges 
and elements of E are called ordinary edges. For a vertex v, we let \n(v) = {{u, v) € E} be the set 
of edges whose head is v. For an edge e = (n, v) £ E, we also use ln(e) = ln(u) to denote the set of 
incoming edges to e. 

A network code in a sourced graph specifies a protocol for communicating symbols on error-free 
channels corresponding to the graph's ordinary edges, given the tuple of messages that originate at 
the source edges. 

Definition 2.1 A network code is specified by a 4-tuple $ = {G, {Se}ee-Eu5; {fe}e£Eus) where 
G = {V, E, S) is a sourced graph, 9?T is a set whose elements are called message-tuples, and for all 
edges e G U 5, Se is a set called the alphabet of e and /e : 9?l — >• Se is a function called the coding 
function of e. If e is an edge and ei, . . . ,efc are the elements of ln(e) then the value of the coding 
function /e must be completely determined by the values of /e^ , . . . , fe,. ■ In other words, there must 
exist a function ge : nj=i ~^ such that for all m £ Tl, fe{rn) = ge{fei{rn), . . . , fe^{m)). 

In graphs with cycles a code can have cyclic dependencies so Definition |2.1| does not suffice 
to characterize the notion of a valid network code. We must impose a further constraint that we 
call serializability, which requires that the network code summarizes a complete execution of a 
communication protocol in which every bit transmitted by a vertex depends only on bits that it 
has already received. 

Below we define serializability formally using a definition implicit in [7J. 

Definition 2.2 A network code $ is serializable if for all e G -E there exists a set of alphabets 
^{i...fc) ^ 5](2)^ _ _ ^ ^ Qf functions /i^-''^ = • • . , /i*'^} such that 

1. fP : 9Jt ^ 

2. V mi,m2 e M, if fe{mi) = fe{m2), then \/i, fe\mi) = /i*^(m2), 

^In prior work it is customary for the underlying network to also have a special set of receiving edges. Specifying 
a special set of receivers is irrelevant in our work, so we omit them for convenience, but everything we do can be 
easily extended to include receivers. 
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3. V mi, 1^2 G 97t, if fe{mi) / /e(m2), then 3i, fe\mi) / fe\m2), and 

4. V m G 9Jt, e G -E, J G {1 . . . A:} there is some function h^J^ such that 

\eeln(e) 

We cah such a /i^'"'^^ a serialization of <I>. 

The function /e describes the information sent on edge e at time step i. Item 2 requires that 
together the functions f'^"^^ send no more information than /e and Item 3 requires that f^^"^^ 
sends at least as much information as /e- Item 4 requires that we can compute fe given the 
information sent on ah of e's incoming edges at previous time steps. 

In working with network codes, we will occasionally want to compare two network codes <I>, 
such that "transmits all the information that is transmitted by <I>." In this case, we say that 
is an extension of <I>, and <I> is a restriction of <!>'. 

Definition 2.3 Suppose that $ = (G, {Se}, {/e}) and = (G, {S'J, {/^}) are two net- 
work codes with the same sourced graph G and the same message set 9Jl. We say that $ is a 
restriction of <!>', and is an extension of <I>, if it is the case that for every m G 9Jt and e £ E, 
the value of fe{m) completely determines the value of /e(m); in other words, fe = ge° f'e for some 
function : S'e — )• Sg. 

The entropic vector of a network code gives a non-negative value for each subset of a network 
code. The value of an edge set F is the Shannon entropy of the joint distribution of the random 
variables associated with each element of F, as is formalized in the following definition. 

Definition 2.4 Given a network code $ = (G, 9H, {T,^}, {/g}), G = {V, E, S), the entropic vector 
of <I> has coordinates H{F) defined for each edge set F = {ei, . . . , Cj} E U S by: 

H{F) = H{eie2...ej) = ^ -p{xi,X2, ■ ■ ■ ,Xj)log{p{xi,X2, . . . ,Xj)), 

where the probabilities are computed assuming a uniform distribution over 5Jt. 




3 Serializability and Entropy Inequalities 

Constraints imposed on the entropic vector alone suffice to characterize serializability for DAGs, 
but, the addition of one cycle causes the eutopic vector to be an insufficient characterization. We 
show that the entropic vector is not enough to determine serializability even on the 2-cycle by 
giving a serializable and non-serializable code with the same entropic vector. 

The two codes illustrated in Figure [T] apply to the message tuple {X, Y, Z) , where X, Y, Z are 
uniformly distributed random variable over F2. It is easy to check that the entropy of every subset 
of corresponding source and edge functions is the same, and thus the codes have the same entropic 
vector. The code in Figure |l(a) is clearly serializable: at time step one we send X on {u, v) and 
Y on {v,u); then Y on {u,v) and X on {v,u). On the other hand, the code in Figure [l(b) is not 
serializable because, informally, to send X + Y on the top edge requires that we already sent X + Y 



^Throughout this paper, when the operator Yl is apphed to functions rather then sets we mean it to denote the 
operation of forming an ordered tuple from an indexed hst of elements. 
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(X,Y) (X+Y,Z) 




(X,Y) (X+Y,Z) 
(a) Serializable (b) Not Serializable 

Figure 1: Two network codes with the same entropy function. 



on the bottom edge, and vice versa. A formal proof that the code in Figure 2(b) is not seriaUzable 
can be obtained by applying the characterization of serializability in Theorem |4.6[ 

In order to use entropy inequalities to give tight upper bounds on network coding rates, we 
need an enumeration of the complete set of entropy inequalities implied by serializability, i.e. a 
list of necessary and sufficient conditions for a vector V to be the entropic vector of a serializable 
code. (Note that it need not be the case that every code whose entropic vector is V must be 
serializable.) For the 2-cycle we can enumerate the complete set of inequalities. In particular, we 
give four inequalities that must hold for any serializable code on the 2-cycle: two are a result of 
downstreamness which is a condition that must hold for all graphs (it says that the entropy of the 
incoming edges of a vertex must be at least as much as the entropy of the incoming and outgoing 
edges together), the third is the Chicken and Egg inequality due to [i7J, and the fourth is a new 
inequality that we call the greedy inequality. It is equivalent to being able to complete the first 
iteration of our greedy algorithm in Section |4j We show that these four inequalities together with 
Shannon's inequalities are the only inequalities implied by serializability, in the following sense: 

Theorem 3.1 Given a rational-valued entropic vector, V, of a 2-cycle on nodes u, v, with source 
X into node n, source y into node v, and edges a = {u,v) and b = (f,n), there exists a se- 
rializable code that realizes cV, for some constant c, if and only if V satisfies Shannon's in- 
equalities, downstreamness {H{abx) = H{bx), H{aby) = H{ay)), the Chicken and Egg inequal- 
ity {H{ab) > H{abx) - H{x) + H{aby) - H{y)), and the greedy inequality {H{a) + H{b) > 
H{ax) - H{x) + H{by) - H{y) when H{a) + H{b) / 0). 

Multiplication of the vector by a constant c is a natural relaxation because the theorem becomes 
oblivious to the base of the logarithm we use to compute the Shannon entropy. 

The proof of Theorem |3.1| involves considering four cases corresponding to the relationship 
between i?(a) and H (ax) , H {ay) , H (x) , H (y) and between (6) and H{bx), H (by) H{x), H{y). Each 
case requires a distinctly different coding function to realize the entropic vector. All the coding 
functions are relatively simple, involving only sending uncoded bits, and the XOR of two bits. Most 
of the work is limiting the values of coordinates of the entropic vector based on the inequalities 



that the entropic vector satisfies. The proof of Theorem |3.1| is provided in Appendix [Aj 

The big open question left from this work is whether we can find a complete set of constraints 
on the entropic vector implied by the serializability of codes on arbitrary graphs. We currently do 
not know of any procedure for producing such a list of inequalities. Even if we had a conjecture 
for such a list, showing that it is complete is likely to be quite hard. If we have more than three 
sources, just determining the possible dependencies between sources is difficult because they are 
subject to non-Shannon information inequalities. 



5 



4 A Characterization of Serializability 



4.1 Linear Codes 

A characterization of serializability for linear network codes is simpler than the general case because 
it relies on more standard algebraic tools. Accordingly, we treat this case first before moving on to 
the general case. Throughout this section we use V* to denote the dual of a vector space over a 
field F, and /* to denote the adjoint of a linear transformation /. For an edge e with alphabet Sg 
and coding function /g : 9Jt — )• Eg, we use Te to denote the linear subspace C 

Though it is impossible to characterize the serializability of a network code in terms of its 
entropic vector, computationally there is a straightforward solution. In polynomial time we can 
either determine a serialization for a code or show that no serialization exists using the obvious 
algorithm: try to serialize the code by "sending new information when possible." When we can no 
longer send any new information along any edge we terminate. If we have sent all the information 
required along each edge, then the greedy algorithm finds a serialization; otherwise, we show that no 
serialization exists by presenting a succinct certificate of non-serializability. Though our algorithm 
is straightforward, we believe that the change in mindset from characterizing codes in terms of the 



entropic vector is an important one, and that our certificate of non-serializability (see Definition 4.1) 
furnishes an effective tool for addressing other questions about serializability, as we shall see in later 
sections. 

Given a network code $ = (G, 9Jt, {Sg}, {/g}), with coding functions over the field F, our 



greedy algorithm (pseudocode, LinSerialize(<I>) , is given in Appendix B.l), constructs a set of 



edge functions /i^"'^^ and alphabets Sg^"'^'* for each edge. These objects are constructed iteratively, 

(i) (i) fh 

defining the edge alphabets Sg and coding functions /g in the i iteration. Throughout this 
process, we maintain a pair of linear subspaces ^g, ^ for each edge e = (u, f) of G. 
is the linear sparj^ of all the messages transmitted on e so far, and Bf> is intersection of Tg with 
the linear span of all the messages transmitted to u so far. (In other words. Be spans all the 
messages that could currently be sent on e without receiving any additional messages at u.) In 
the z*^ iteration, if there exists an edge e' such that B^,' contains a dual vector Xg/ that does not 
belong to Ag/, then we create coding function fe'^ for all e. The coding function of f^^) is set to 

be Xg' and its alphabet is set to be F. For all other edges we set f^ ' = 0. This process continues 
until Be = Ae for every e. At that point, we report that the code is serializable if and only if 
Ae = Tg for all e. At the end, the algorithm returns the functions f^^"^^ and the alphabets 
where k is the number of iterations of the algorithm, as well as the subspaces {^g}. If the code 
was not serializable, then {A^} is interpreted as a certificate of non-serializability (a "non-trivial 
information vortex") as explained below. 

LinSerialize($) runs in time polynomial in the size of the coding functions of ^. In every 
iteration of the while loop we increase the dimension of some A^ by one. A^ is initialized with 
dimension zero and can have dimension at most dim(Tg). Therefore, the algorithm goes through 
at most Xlee-B dim(rg) iterations of the while loop. Additionally, each iteration of the while loop, 
aside from constant time assignments, computes only intersections and spans of vector spaces, all 
of which can be done in polynomial time. 

To prove the algorithm's correctness, we define the following certificate of non-serializability. 

Definition 4.1 An information vortex (IV) of a network code consists of a linear subspace We ^ 
dJl* for each edge e, such that: 

1. For a source edge s, Wg = Tg. 

^If {Vi : i G 1} is a collection of linear subspaces of a vector space V , their linear span is the minimal linear 
subspace containing the union Uigi ^i- We denote the linear span by +igiVi. 
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2. For every other edge e, We = n (+e'ein(e)W^e') • 
An information vortex is nontrivial if We 7^ for some edge e. 

We think of We as the information that we can send over e given that its incoming edges, 
e' S In{e), can send We'- In our analysis of the greedy algorithm, we show that the messages 
the greedy algorithm succeeds in sending (i.e., the linear subspaces {^e}) form an IV and it is 
non-trivial if and only if the code isn't serializable. 

The following theorem shows the relationship between IVs, serialization, and the greedy algo- 



rithm. The proof can be found in Appendix B.l 



Theorem 4.2 For a network code <I> = (G, 9K, {Eg}, {/g}), the following are equivalent: 

1. $ is not serializable 

2. LinSerialize($) returns {Ae} s.t. 3e,Aej^ Tg 

3. $ has a non-trivial information vortex 

In Section [5] and Section [6] we will see that information vortices provide a concise way for proving 
the non-serializability of a network code. Moreover, the notion of an information vortex was critical 
to our discovery of the result in Section |6| 



4.2 General codes 

Our characterization theorem extends to the case of general network codes, provided that we 
generalize the greedy algorithm and the definition of information vortex appropriately. The message 
space is no longer a vector space, so instead of defining information vortices using the vector 
space Wl* of all linear functions on OJI, we use the Boolean algebra 2^ of all binary- valued functions 
on We begin by recalling some notions from the theory of Boolean algebras. 

Definition 4.3 Let 5 be a set. The Boolean algebra 2'^ is the algebra consisting of all {0, 1}- 
valued functions on S, under and (A), OR (V), and not (-■). If / : S" — t- T is a function, then the 
Boolean algebra generated by /, denoted by (/), is the subalgebra of 2"^ consisting of all functions 
6 o /, where 6 is a {0, 1}- valued function on T. If Ai,A2 are subalgebras of a Boolean algebra A, 
their intersection Ai n ^2 is a subalgebra as well. Their union is not, but it generates a subalgebra 
that we will denote by Ai + A2■ 
l^ S is a finite set and ^ C 2'^ is a Boolean subalgebra, then there is an equivalence relation on 
S defined by setting rc ~ y if and only if b{x) = b{y) for all b £ A. The equivalence classes of this 
relation are called the atoms of A, and we denote the set of atoms by At(^). There is a canonical 
function : 5" — )• At{A) that maps each element to its equivalence class. Note that A = (/a)- 

The relevance of Boolean subalgebras to network coding is as follows. A subalgebra A C 2^ is a 
set of binary- valued functions, and can be interpreted as describing the complete state of knowledge 
of a party that knows the value of each of these functions but no others. In particular, if a sender 
knows the value of f{m) for some function f : ^ ^ T, then the binary-valued messages this sender 
can transmit given its current state of knowledge correspond precisely to the elements of (/). This 
observation supplies the raw materials for our definition of the greedy algorithm for general network 
codes, which we denote by GenSerialize(<I>). 

As before, the edge alphabets and coding functions are constructed iteratively, with T,^^ and 
fe^ defined in the i*'^ iteration of the main loop. Throughout this process, we maintain a pair 
of Boolean subalgebras Ae, Be Q for each edge e = {u,v) of G. Ae is generated by all the 
messages transmitted on e so far, and Be is intersection of (/e) with the subalgebra generated by 
all messages transmitted to u so far. (In other words. Be spans all the binary-valued messages that 
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could currently be sent on e without receiving any additional messages at u.) In the i^^ iteration, 
if there exists an edge e' such that B^,/ contains a binary function x^i A^,^, then we create a 
binary-valued coding function fe^ for all e, which is set to be Xg' if e = e' and the constant function 
if e 7^ e' . This process continues until Be = Ae for every e. At that point, we report that the 
code is serializable if and only if Ag = (/e) for all e. At the end, the algorithm returns the functions 
fe^"^^ and the alphabets Tj^"^\ where k is the number of iterations of the algorithm, as well as the 
subspaces {^e}- The pseudocode for this algorithm GenSerialize(<I>) is presented in Appendix [Cj 
If $ has finite alphabets, then GenSerialize(<I>) must terminate because the total number of 
atoms in all the Boolean algebras A^ (e G E) is strictly increasing in each iteration of the main 
loop, so YleeE l^el Upper bound on the total number of loop iterations. In implementing the 
algorithm, each of the Boolean algebras can be represented as a partition of 9Jt into atoms, and all of 
the operations the algorithm performs on Boolean algebras can be implemented in polynomial time 
in this representation. Thus, the running time of GenSerialize(<I>) is polynomial in X^gg^ l^e|- In 
light of the algorithm's termination condition, the following definition is natural. 

Definition 4.4 If G = {V, E, S) is a sourced graph, a generalized information vortex ( GIV) in a 
network code $ = (G, 971, {Eg}, {/g}) is an assignment of Boolean subalgebras A^ C 2®^ to every 
e G EU S, satisfying: 

1. As = ifs) for ah se S; 

2. Ae = (+gein(„)^e) H (/g) for ah e = (u, v) G E. 

A GIV is nontrivial if Ae ^ (/g) for some e £ E. A tuple of Boolean subalgebras T = {Ae)e£Eus is 
a semi-vortex if it satisfies ([T]) but only satisfies one-sided containment in Q, i.e., 

3. Ae C n ife) for ah e = {u,v) G E. 

If r = (Ae) and T = {A'^) are semi-vortices, we say that T is contained in T if Ag C A'^ for all e. 



In Appendix [C] we prove a series of statements (Lemmas C.2 C.5) showing that: 

• Semi- vortices are in one-to-one correspondence with restrictions of The correspondence 
maps a semi-vortex {Ae)eeEus to the network code with edge alphabets At{Ae) and coding 



functions given by the canonical maps 9JT — )• At{Ae) defined in Definition 4.3 



• There is a set of semi-vortices corresponding to serializable restrictions of $ under this cor- 
respondence. They can be thought of as representing partial serializations of ^. 

• There is a set of semi-vortices corresponding to GIV's of <I>. These can be thought of as 
certificates of infeasibility for serializing 

• GenSerialize($) computes a semi-vortex F which is both a GIV and a partial serialization. 

These lemmas combine to yield a "min-max theorem" showing that the every network code has 
a maximal serializable restriction that coincides with its minimal GIV, as well as an analogue of 
Theorem 4.2; proofs of both theorems are in Appendix [C| 

Theorem 4.5 In the ordering of semi- vortices by containment, the ones corresponding to partial 
serializations have a maximal element and the GIV's have a minimal element. These maximal and 
minimal elements coincide, and they are both equal to the semi- vortex T = {Ae)eeEus computed 
by GenSeriaIize(<I>). 

Theorem 4.6 For a network code <I> with finite alphabets, the following are equivalent. 

1. $ is serializable. 

2. GenSerialize(<^) outputs {Ae}eeE s.t. Ve, ^dg = (/g). 

3. $ has no nontrivial GIV. 
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5 The Serializability Deficit of Linear Network Codes 



The min-max relationship between seriaHzable restrictions and information vortices (Theorem 4.5) 
is reminiscent of classical results like the max-flow min-cut theorem. However, there is an important 
difference: one can use the minimum cut in a network to detect how far a network flow problem 
is from feasibility, i.e. the minimum amount by which edge capacities would need to increase in 
order to make the problem feasible. In this section, we will see that determining how far a network 
code is from serializability is more subtle: two network codes can be quite similar-looking, with 
similar-looking minimal information vortices, yet one of them can be serialized by sending only one 
extra bit while the other requires many more bits to be sent. 

We begin with an example to illustrate this point. The codes in Figure [2] apply to the message 
tuple {Xi, . . . , Xn, Yi, . . . ,Yn) where Xi, Y-i are independent, uniformly distributed random variables 



over F2. The codes in Figures 2(a)| and 2(b) are almost identical; the only difference is that the 
code in Figure 2(a) has one extra bit along the top edge. The code in Figure 2(a) is serializable: 
transmit Xi along {u, v), then Xi + Yi on edge {v,u), then X2 + Yi on {u,v), ... ,Xn + Yn on (v, u), 
and finally Xi + 1^ on (u, v). On the other hand, the code in Figure [2(b) is not serializable, which 
can be seen by applying our greedy algorithm. 



(Xi,X2+Yi,X3+Y2,....X,+Y„) 



(X2+Yi,X3+Y2,...,Xi+Y„) 



(X„X^ 




(Y„Y2,...,Y) 



(X,+Yi,X2+Y2,...,X„+Y„) 
(a) Serializable 




(X,+Yi,X2+Y2,...,X„+Y„) 
(b) Not Serializable 



Figure 2: Two almost identical network codes. 



Thus, the code in Figure 2(b) is very close to serializable because we can consider an extension 
of the code in which we add one bilj^to the edge {u,v) to obtain the code in Figure 2(a) that is 
serializable. On the other hand, there are similar codes that are very far from being serializable. If 
we consider the code with the same sources and f(^u,v) = f{v,u) = IlILi + its edge alphabets 
have the same size and its minimal information vortex is identical, yet any serializable extension 
requires adding n bits. To completely characterize serializability we would like to be able to separate 
codes that are close to serializable from those that are far. This motivates the following definition. 

Definition 5.1 For a network code <I> = (G, Tl, {Sg}, {/e}) and an extension = {G' {S^}, {/g}). 



the gap of defined by j{<^') = Eeei?log2 l^e 



log2 |Se| 



represents the combined number of 



extra bits transmitted on all edges in ^ as compared to The serializability deficit of $, denoted 
by SD($), is defined to be the minimum of 7(<I*') over all serializable extensions ^' of The linear 
serializability deficit of a linear code <I>, denoted LSD(<I>), is the minimum of 7(<I*') over all linear 
serializable extensions 

Unfortunately, determining the serialization deficit is much more difficult than simply deter- 
mining serializability. 

Theorem 5.2 Given a linear network code it is NP-hard to approximate the size of the minimal 
linear serializable extension of <I>. Moreover, there is a linear network code ^ and a positive integer 
n such that LSD($")/(nLSD(<I>)) < 0{^^). 



*In this section, for simplicity, we refer to one scalar-valued linear function on an F-vector space as a "bit" even 
if |F| > 2. 
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Both statements in the theorem foUow directly from the following lemma. 
Lemma 5.3 Given a hitting set instance {N, S) with universe A^, |A^| = n, and subsets S C 2^, 



an optimal integral solution fc, and an optimal fractional solution .., - 

polynomial time we can construct a linear network code such that LSD(<I>) 

Proof sketch. 



, with V" ^ = 2,in 
k, but LSD($9) < p. 




Figure 3: The reduction from Hitting Set 



The full proof of the Lemma is in Appendix B.l 
Here, due to space limitations, we merely sketch 
the main ideas. The graph used in the reduc- 
tion is illustrated pictorially in Figure [3j Given 
a hitting set instance (A^, S) we create a net- 
work code with one source for each i G N 
(source message denoted by Yi) and a super- 
source s (source message denoted {Xi , X2 , . . . , X„ 
The symbols Xi,Yi don't refer to bits, but ac- 
tually to blocks of rii bits, where rii is the num- 
ber of sets in S containing i; each of the bits 
in Xi or Yi corresponds to one of the sets that 
i belongs to. For each i £ N we use a gadget 
consisting of a 2-cycle on vertices Ui,Wi, with Yi 
feeding into Wi and Xi feeding from the super- 
source s into Ui. The edges between Ui and Wi 



are a copy of the gadget in Figure [2j We exploit the fact that sending one extra bit in this gadget 
allows the information vortex in the gadget to "unravel", leading to transmission of all the bits 
encoded on the edges of the 2-cycle. The 2-cycle {ui,Wi) participates in a larger 4- vertex gadget 
{ui,Wi,pi,Vi} corresponding to the element i. The role of Vi is to participate in "set gadgets", where 
the gadget corresponding to a set A consists of a bidirected clique on all the vertices {vj\j G A}. 
The role of pi is less important; it plays a necessary part in disseminating bits to leftover parts 
of the network after the "important" parts have been serialized. If there is a hitting set of size k 
then we send one bit on each of the 2-cycles {ui,Wi) corresponding to elements i in the hitting set. 
This "unlocks" the bits that were locked up in those 2-cycles, which allows a sufficient amount of 
information to flow into the set gadgets that they become serialized. The vertices pj are then used 
for disseminating the remaining bits to the unused 2-cycles {uj,Wj) where j did not belong to the 
hitting set. 

To prove, conversely, that a serializability deficit of at most k implies that there is a hitting set 
of size k, we make use of the fact that the network code constructed by our reduction has a large 
number of information vortices, one for each pair consisting of an element of A^ and a set S that it 
belongs to. If C is the set of all i such that an extra bit is transmitted somewhere in the 4-vertex 
gadget for i, and C fails to contain an element of some set A, then this in turn implies that one 
of the aforementioned information vortices remains an information vortex in the extension of the 
code. Thus, C must be a hitting set. 

The more difficult step in proving Lemma 5.3 lies in showing that fractional solutions of the 
hitting set problem can be transformed into efficient serializable extensions of For this, we 
make use of the fact that the edge alphabets in can be regarded as Ee^IF^. and their duals can 
be regarded as S* (F*)'^. If |F| is large enough, then the uniform matroid Uq^p is representable as 
a set {ti, . . . ,tp} of p vectors in (F*)'^. For each of the "fractional elements" Zi/q in the fractional 
hitting set, we send Zi bits of the form x^t in the extension of where t is one of the elements of 
our matroid representation of Uq^p in (F*)'' and x is the bit that we would have sent for element i 
in the hitting set reduction described two paragraphs earlier. The fact that ^ • Zi = p implies that 
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every element of the matroid representation is used exactly once in this construction. The fact that 
we have a fractional set cover implies that in each set gadget, we receive extra bits corresponding 
to q distinct elements of S. Since these elements are a basis for (F*)'', it is then possible to show 
that they combine to allow a serialization of all the "missing bits" in that gadget, and from there 
we finish serializing the entire network code as before. 



6 Asymptotic Serializability 

The results in the previous section showed both that there are non-serializable codes with large 
edge alphabets that become serializable by adding only one bit (example in Figure |2]) and that 
the serialization deficit can behave sub-additively when we take the n-fold cartesian product of ^ 



(Theorem 5.2). This prompts the investigation of whether there exists a code that isn't serializable, 
but the n-fold parallel repetition of the code can be serialized by extending it by only a constant 
number of bits, and thus it is essentially indistinguishable from serializable. We formalize this idea 
with the following definition. 

Definition 6.1 A network code ^ is asymptotically serializable if lim„_j.oo LSD($")/LSD($) = 
where is n-fold cartesian product of ^ with cartesian product define in the obvious way. 

If one is using a network code to transmit infinite streams of data by chopping each stream 
up into a sequence of finite blocks and applying the specified coding functions to each block, then 
an asymptotically serializable network code is almost as good as a serializable one, since it can be 
serialized by adding a side channel of arbitrarily small bit-rate to each edge of the network. 

Despite indications to the contrary in Section [5j we show that any non-serializable linear code 
is not asymptotically serializable via the following theorem. 

Theorem 6.2 For a linear network code <i> = (G, 9Jt, {Sg}, {/g}) over a field F, then LSD(^>") > cn 
where c is a constant dependent on <I>. 

The proof of the theorem considers the alphabets of the n-fold product of $ as elements of 
a tensor product space. Using this machinery, we show that information vortices in the graph 
are preserved if we don't increase the amount of information we send down some edge by order n 
bits. More specifically, if {Wg} is a non-trivial information vortex in <I>, and e is an edge such that 
dim(We) < dim(re) = m, then if we add some edge function / to every edge in the graph, the 
information vortex remains non-trivial as long as the dimension of / is less than mn. A complete 
proof is provided in Appendix [Ej 
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A Proof of Theorem 3.1 



Theorem A.l (Theorem 3.1 restated) Given a rational-valued entropic vector, V, of a 2-cycle 
on nodes u,v, with source x into node u, source y into node v, and edges a = {u,v) and b = {v,u), 
there exists a serializable code that realizes cV, for some constant c, if and only if V satisfies 
Shannon's inequalities, downstreamness {H{abx) = II{hx), II{aby) = H{ay)), the Chicken and Egg 
inequality {H{ab) > H{abx) — H{x) + H{aby) — II{y)), and the Greedy inequality {II{a) + II{b) > 
H{ax) - H{x) + H{by) - H{y) when H{a) + H{b) / 0). 

Throughout this proof it will often be convenient to refer to the conditional entropy of two sets 
of edges. 

Definition A. 2 For two subsets of edges F = {ei, 62, ■ ■ . , Cj} and F' = {e'l, e'^, . . . , e'^,} , the 
conditional entropy of F given F' , denoted H{F\F') = H{eie2 ■ ■ . ej\e'^e'2 . ■ . e'^) = H{FF')-H{F'). 
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We first show that all the inequalities are necessary. Shannon's inequalities must hold for the 
entropic vector of any set of random variables. Downstreamness (term coined by [Ilj) was shown 
to be necessary even for DAGS by Yeung and Zhang |l6j. Harvey et al. [7j showed that the Chicken 
and Egg inequality is necessary. Thus, it remains to show that our greedy inequality is a necessary 
condition for serializability. 

Lemma A.3 The inequality H{a) + H{h) > H{ax) - H{x) + H{by) - H{y) = H{a\x) + H{h\y) 
holds for any serializable code on a 2-cycle when H{a) + H{h) > 0. 

Proof. Suppose there is a serializable code such that H{a) + H{h) < H{a\x) + H{h\y) and H{a) + 
H[b) > 0. Because conditioning reduces entropy, H{a) > H{a\x), and likewise H{b) > H{b\y). 
These three inequalities together imply that H{a) = H(a\x) and H(b) = H{b\y). It follows from 
the definition of serializability and H{a) + H{b) > that there exists a non-zero fa^ or fj^^\ Let 
i* be the smallest such i and let fa be the associated non-zero coding function (the choice of a 

is without loss of generality). We can rewrite H{a\x) as H{fa \x) + H{a\fa '^x). H{fa = 

(«*) 

because i* is the smallest such i implies that fa is computed soley from x. But, this gives us that 
H{a) = H{a\fPx), which is a contradiction to fa '^ non-zero. □ 



To prove the other direction of Theorem |3.1| we will use a case based analysis, but first we make 
a few observations to bound the cases we need to consider. 

Observation A. 4 The following ten values completely determine the entropic vector of the 2- 
cycle: I{x; y),H{x\y), H {y\x) , H {a\x) , H {b\x) , H {a\y) , H{b\y),H{a), H(b),H{ab). 

Proof. Due to downstreamness and Shannon's inequalities the following equations hold: H{xy) = 
I{x-y) + H{x\y) + H{y\x),H{axy) = H{bxy) = H{abxy) = H{xy),H{y) = I{x;y)+H{y\x), H{x) = 
I{x; y) + H{x\y), H{ax) = H{a\x) + H{x), H{aby) = H{ay) = H{a\y) + H{y), H{abx) = H{bx) = 
H{b\x) + H{x), H{by) = H{b\y) + H{y). This implies that the value of all 15 non-zero elements of 
the entropic vector are determined by the 10. □ 

Observation A. 5 H{b\x) > H{a\x) 

Proof. H{bx) = H{abx) > H{ax) by downstreamness and then monotonicity. □ 
Observation A. 6 H{a\y) > H(b\y) 

Proof. Parallel to proof of observation |A.5| □ 

Observation A. 7 max{H{a\x), H{a\y)) < H{a) < H{a\x) + H{a\y) + I(x; y) 

Proof. H{a) < H{a\x) + H{a\y) + I(x;y): Apply submodularity on ax and ay to get H{ax) + 
H{ay) > H{axy) + H(a) = H{xy) + H{a), then subtract H{x) + H{y) from both sides. 
H{a) > maL,x(H{a\x), H{a\y)) : H(a) > H{a\x) and H{a) > H{a\y) because conditioning reduces 
entropy. □ 

Observation A. 8 max{H {b\x) , H {b\y)) < H{b) < H{b\x) + H{b\y) + I{x; y) 

Proof. Parallel to proof of observation |A.7| □ 

Observation A. 9 H{b\x) + H{a\y) < H{ab) < H{b\x) + H{a\y) + I{x; y) 
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Proof. H{ab) > H{h\x) + H{a\y) by the chicken and egg inequahty. 
H{ab) < H{b\x) + H{a\y) + I{x; y): by submodularity on ay and bx: 
H{ay) + H{bx) = H{aby) + H{abx) > H{abxy) + H{ab). 

^H{ab)<H{bx) + H{ay)-H{xy). □ 



Now, we come to our case analysis for proving the forward direction of Theorem 3.1 

We first multiply our entropic vector by the least common denominator so that all the elements 

of the vector are integer. We show that we can find a code that realizes this integer valued entropic 

vector. 

Let Xi, ...Zj(^^.y-^ be random variables originating at source x, and let Yi, ...,Yfj(^y^^^, 

Zi, Zj^j-.y-j be random variables originating at source y, where Xi,Yj, Zj. are independent for all 
i,j,k. 

We split up the proof into 4 cases. Case 1 corresponds to when H(a) is greater than H{a\x) + 
H{a\y) and H{b) is greater than H {b\x) + H {b\y) . Case 4 takes care of the instances when both H{a) 
is less than H{a\x) + H{a\y) and H{b) is less than H{b\x) + H{h\y). Cases 2 and 3 are symmetric 
corresponding to when exactly one of H{a) and H{b) is greater than the sum of the conditional 
entropy on x and y. Cases 1,2 (or 3), 4 correspond to distinctly different coding functions on edges a 
and b. Case 1 has the simplest codes - we send bits uncoded with the exception of possibly XORing 
X and Z or Y and Z. In cases 2 and 3 we need to XOR bits of X, Y on one edge, and in case 4 we 



need to XOR bits of X, Y on both edges in a manner similar to the example in Figure 2(b) 
Case 1: 



H{a) = H{a\x) + H{a\y) + /, / > and note / < I{x; y) by Observation A. 7 
H{b) = H{b\x) + H{h\y) + g, g>0 and note g < I{x; y) by Observation | A. 8[ 



H{ab) = H{b\x) + H{a\y) + h, and note < /i < /(x; y) by Observation A. 9 



Observation A. 10 h < H{a\x) + H{h\y) + f + g 

Proof. Implied by submodularity on a and b. □ 

Observation A. 11 h > max(f,g) 

Proof. 

H{x\a) > H{x\ab) Conditioning reduces entropy 
H{ax) - H{a) > H{abx) - H{ab) 
H{ab) - H{bx) - H{a\y) + H{x) > H{a) - H{ax) - H{a\y) + H{x) 
H{ab) - H{b\x) - H{a\y) > H{a) - H{a\x) - H{a\y) 

h>f 



The proof that h > g is similar. 

□ 

We claim that the following code realizes the entropic vector and is serializable: 
For notational convenience let Z[ = ^2 — -^Z+Si Z[^_^_y = Zh~g. Any Z'^ with i > h — f — g 

we will take to be 0. 

fa = Xi, . . . , Yi + Z[, . . . , Yjji^a\x) + Z'H{a\x)^ Zi, . . . Zf 

fb = Xi + Z'^[a\x)+l-> ■ ■ -^^Hibly) + ^H{a|x)+H(6|j/) ' ^1 ' • ' • ^^H{b\x), ^h-g-l, ■ ■ . , Zh 
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This is a valid code because H{x\y) > H{a\y) > H{b\y), H{y\x) > H{b\x) > H{a\x), h < I{x;y), 
h < H{a\x) + H{h\y) + f + g, and h > max(/, It is easy to check that this code reahzes the 
entropic vector. It is serializable because H{a\y) > H{b\y), H{b\x) > H{a\x) and both sources 
know Z. 



Case 2: 

H{a) = H{a\x) + H{a\y) — f, f >0 and note / < mm{H{a\x), H{a\y)) by Observation A. 7 
H{b) = H{b\x) + H{b\y) +g, g>0 and note g < I{x; y) by Observation | A. 8[ 
H{ab) = H{b\x) + H{a\y) + /i, and note < /i < /(x; y) by Observation A.9 



Observation A. 12 h < {H{a\x) - /) + H{b\y) + g 

We claim that the following code realizes the entropic vector and is serializable: 
Any Zi with i > h we will take to be 0. 



fa = Xi + Yi,X2 + Y2, . . . ,Xf + Yf,Xf^i, . . . + Zg^i, . . . ,Yjj^a\x) + Zg^H{a\x)-f 

fb = Xi + • • • , XH^p\y-) + ^g+_ff(a|a;)-/+_ff(fe|?/)) ^1; • • • ) ) ^1 ) ■ ■ ■ i^g 

This is a valid code for the same reasons as Case 1, and also because h < {H{a\x) — f) + H(b\y) + g, 
and / < H{a\x) and / < H(a\y). It is easy to check that this code realizes the entropic vector; 
here it is important that g < h which is true by the argument from Observation |A.lT| It is serial- 
izable because we can send Yi, ...,Yjj(^^^-^ along edge b, then because H{b\x) > H{b\y) we can send 
everything along edge a, and then because H{a\y) > H{a\x) we can send all the Xs and Zs on edge b. 



Case 3: 

H{a) = H{a\x) + H{a\y) + and note / < I{x; y) by Observation A. 7 

H{b) = H{b\x) + H{b\y) - 5, 5 > and note g < mm{H{b\x) , H {b\y)) by Observation |A^ 
H{ab) = H{b\x) + H{a\y) + /i, and note < /i < /(x; y) by Observation |A.9| 



Symmetric to Case 2. 



Case 4: 

H{a) = H{a\x) + H{a\y) — f, f >0 and note / < nim{H{a\x), H{a\y)) by Observation A. 7 



H{b) = H{b\x) + H{b\y) - g, g>0 and note g < mm{H {b\x) , H {b\y)) by Observation |X8 
H{ab) = H{b\x) + H{a\y) + /i, and note < /i < /(x; y) by Observation |A.9| 



Applying the inequality H{a) + H{b) > H{a\x) + H{b\y), together with the fact that H(a) > 
H{a\x) and H{b) > H{b\x) implies that at least one of H{a) > H{a\x), H{b) > H{b\y) holds. Or, 
written in terms of f,g this means that at least one of / < H(a\y), g < H{b\x) holds. 

Observation A. 13 h < {H{a\x) - /) + iH{b\y) - g) 

Case 4a: / < H{a\y) 

We claim that the following code realizes the entropic vector and is serializable: 
Any Zi with i > h we will take to be 0. 

fa = X2 + Yi,X3 +Y2, . . . , Xf^i + Yf,Xi,Xf+2, ■■■ , ^H(a|j/) > + ^1, • • • , ^//(a|x) + ^//(a|x)-/ 

fb = -^1+^1, -^^2 +^2, • • • , Xg+Yg, Xg+i+ZH(^a\x)-f+l: ' ' • 5 ^ H {b\y) + ^ H {a\x) - f + H {b\y) - Q 1 ^+1> • • • ! ^J?(b|x-) 
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This is a valid code because /+1 < H{a\y), h < {H{a\x)— f)+{H{b\y)—g), f < min(i/(a|x), -^(al?/)) 
and g < m.m{H {b\x) , H {b\y)) . It is easy to check that this code reahzes the entropic vector. To 
show it is seriahzable, we first consider the case when f < g: we can send Xi along edge a; then 
Xi + Yi along edge b; then X2 + Yi along edge a; . . . ; then Xj^i + Yj, Xf^2, • • • > XH{a\y) along 
edge a; then because H{a\y) > H{b\y), we can send then everything along edge b; and then since 
H{b\x) > H{a\x) we can complete the transmission for edge a. The case for f > g is very similar. 

Case 4b: g < H{b\x) 
This case is similar, but we switch the roles of edge a and edge b. 

B Proofs omitted from Section [4] 
B.l Linear codes 



Algorithm 1 Greedy Algorithm for Linear Codes 
LinSerialize(<l>) 

1: /* $ = (G, 9Jt, {Sg}, {/g}), G = {V,E,S) is a network code with coding functions over field F. 

We construct T,^"''^ and fe^"^\ */ 
2: Af. for all e £ E. /* Ag CI Tg represents the information we have sent over edge e * / 
3: As ^ Ts for all s G 5. 

4: i?e 7e n (+sein(e)^s) for all e £ E. /* Bf, CI Tf, represents the information that the tail of e 

knows about T^* / 
5: i = 1 

6: while 3 e = (m, v) in G such that A^ 7^ i?e do 

7: Let Xf, be any vector in that doesn't lie in A^ 

8: si') ^ F, /i') ^ Xe 
9: ^ + (^e) 

10: V e' G ^, e' ^ e, ^ 0, ff ^ 

11: Ve' = (f , •) G E', Be' ^ Tg' n (fig/ + {xe}) /* Node v "learns" Xe */ 

12: i + + 

13: end while 



Theorem B.l (Theorem 4.2 restated) For a network code ^> = (G, 9H, {Sg}, {/g}), the following 
are equivalent: 

1. $ is not seriahzable 

2. LinSerialize($) returns {^e} s.t. 3 e, ylg 7^ T'e 

3. $ has a non-trivial information vortex 

Proof. -|2] =^ -[T] If LinSerialize(<I>) returns {^e} s.t. Ve, = ?e then $ is seriahzable: 

We show that the f^"^\ T!"e"^^ created by LinSerialize(<l') satisfy the conditions in Definition 2.2 

1. /i*) : 9Jt — )• Se*) by construction. 

2. The non-zero functions /i*'' form a basis for Tg. Because linear maps are indifferent to the 
choice of basis, if feimi) = fe{m2) then in any basis, each coordinate of feimi) equals the 
corresponding coordinate of /e(?7i2), and thus fe\mi) = fe\'m2) for all i. 
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3. If fe{mi) ^ fein^2) then for any basis we choose to represent fe, the values /e("i-i), /e("^2) 
will differ in at least one coordinate, and thus 3i, fe\m2) 7^ fe\'m2)- 

4. When we assign a function fe — Xe we have that Xe is in Be which guarantees it is computable 
from information already sent to the tail of e. 

[2] =^ [3] If LinSerialize($) returns {^e} s.t. 3eAey^Te then <I> has a non-trivial IV. 
We claim the the vector spaces {Ae} returned by LinSerialize(<I>) form a non-trivial IV. {Ae} is 
non-trivial by hypothesis, so it remains to show it is an IV. {Ae} satisfies property (1): For each 
S ^ S, As = Ts by construction (Line 3 of LinSerialize(<I>) ). 

{^e} satisfies property (2): By induction on our algorithm, Be is exactly Te n (+e'ein(e)^e') • At 
termination. Be = Ae for all e € E. So, we have that Ae = Ten (+e'ein{e)^e') • 

[3l [1] If <I> has a non-trivial IV then it isn't serializable. 

Suppose for contradiction that $ = {G, Tl, jS pj, {fe}), G = iV, E, S) is serializable. Let f^^''^'' and 



H^"^^ satisfy the conditions of definition 2.2 Let {We} be a non-trivial IV for 



We say that a function /i"'^ has property P if there 3 1711,1112 € 9JT such that fe'{fni) 7^ fe'"{rn2) 
and nil, 1712 S W^. There must be such a function since our IV is non-trivial and fe^"^^ is a 

serialization of <I>. Let i* be the smallest i such that any function satisfies property P and suppose 
/g* satisfies P with messages m\ , 772-2 • 

By definition. We* = Te* n {+e'e\n{e*)We') , so m\,m*2 S W^* implies that for all e' G ln(e*), 
m\,m2 G W^. But, /** can distinguish between m\,m2 so at least one of e' G ln(e*) must also 
be able to distinguish between m*,m2 at a time before i* . Therefore, there exists some i' < i* 
that satisfies property P, a contradiction to the fact that i* was the smallest such i. □ 



C Proofs omitted from Section 4.2 



The following lemma is standard; for completeness, we provide a proof here. 
Lemma C.l Suppose /i : — )• Ti and f2'S^T2 are two functions on a set S. 
1- (/2) ^ (/i) if and only if there exists a function g : Ti ^ T2 such that f2 = g o fi. 

2. Suppose S is finite. If /i x /2 denotes the function 5 — )• Ti x T2 defined by x 1— )• f2{x)), 
then (/i> + (/2) = (/ix/2). 

Proof. A Boolean subalgebra of 2"^ can be equivalently described as a collection of subsets of S, 
closed under union, intersection, and complementation, by equating a {0, l}-valued function h with 
the set In this proof we adopt the "collection of subsets" definition of a Boolean subalgebra 

of 2^^, since it is more convenient. Note that under this interpretation, if / : 5 — )• T is any function 
then (/) consists of all subsets of the form /"^([/), [/ C T. 

If f2 = g o fi for some g, then every set of the form f2^{U) can be expressed as fi'^{g~^{U)) 
which shows that (/2) C {fi). Conversely, if (/2) C [fi) then for every u ^ T2 the set /^^({w}) G (/2) 
belongs to (/i), i.e. it can be expressed as fi^{Vu) for some set K C Ti. The sets /f^K) are 
disjoint as u ranges over the elements of T2 so the sets Vu themselves must be disjoint. Define 
g{v) = if v G Vu for some u G T2, and define g{v) to be an arbitrary element of T2 otherwise. 
For any x G 5, if ti = /2(x) then x G f2^{u) = fi'^{Vu), which implies that g{fi{x)) = u. Hence 
f2 = g°fi as desired. 

To prove ^ we argue as follows. Clearly (/i), (/2) ^ (/^ x 72), so (/i) + (72) C (/^ x /2) as 
well. For the reverse inclusion, note that every element of (/i x 72) can be expressed as a finite 
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union of sets of the form (/i x "^(^1,^2)- Every such set can be expressed as (ti) H (^2), 
which proves that it belongs to (/i) + (/2). □ 



Algorithm 2 Greedy algorithm for general network codes 
GenSerialize(<l>) 

1: /* ^ = {G,m,{^^},{f^}),G = {V,E,S) is a network code. */ 
2: /* We construct Tj^"''^ and fe^"^\ */ 

3: ^ for all e G E. /* Ae Q (/e) represents the information we have sent over edge e * / 

4: As ^ {fs) for ah s £ S. 

5: Be ^ ife) H (+,eln(e)^) for ah e£E. 

6: /* Be ^ (/e) represents the information that the tail of e knows about fe */ 
7: i = l 

8: while 3 e = {u, v) in G such that Ae / Be do 

9: Let Xe be any binary- valued function in Be \Ae. 

10: si'V{0,l}, /i'Vxe 
11: Ae^ Ae + \xe) 

12: V e' G e' / e, ^ {0}, /^^^^ ^ 

13: Ve' = (?;, •) e E, Be' ^ {fe') n {Be' + (xe)) /* iVode V "learns" Xe */ 

14: i ^ i + 1 
15: end while 



Lemma C.2 For a given network code restrictions of $ are in one-to-one correspondence 
with semi- vortices F. The correspondence maps F to the network code $'[F] whose alphabets are 



Ti'e = At{Ae) and whose coding functions are the functions /g = /a^ defined in Definition 4.3 
The inverse correspondence maps to the unique semi-vortex F[$'] satisfying Ar = (/^) for all 
reEUS. 

Proof. Suppose F = (^4^.) is a semi- vortex and $'[F] is defined as stated, with coding functions 
fe = fAe- For all e = {u, v) £ E, let ln(e) = {ei, . . . , e^} and let = /g^. The relation 

Ae = [+tlAe,) n (/e) 

implies that 

Ae^+t,Ae^=+l,{f,) = {{fl,...,fk)), 



where the last equation follows from Lemma C.l[ Since (/aj) = Ae Q {{fi, • • • , fk))-, we can apply 



Lemma C.l again to conclude that /a^ = 5 o (J^, . . . , /^) for some g. Thus $'[F] is a network code. 



To prove that it is a restriction of we use the containment Ae C (/g) for every edge e £ E U S, 



together with Lemma C.l, to construct the functions (7e : Sg — )• Eg required by the definition of a 



restriction of 'I'. □ 

Lemma C.3 If is a restriction of and is serializable, then F[$'] is contained in every 
GIV of ^>. 

(i) 

Proof. Suppose $ is a serializable restriction of with serialization consisting of alphabets Eg 
and coding functions fe^\ 

Suppose now that F = {Ae}e£Eus is any GIV of <I>. First, we claim (/i'^) C Ae for every edge 
e. To prove the claim we use induction on i. The claim is clearly true when i = 0. Otherwise, 
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C.l 



(i) 

the existence of a function hi such that 



let ei,...,er be the edges in ln(e). By Lemma 

fe\m) = he^ (nj=i /ej'*"^) i™plies the first of the following containments: 

(#>^+^=l+£l(if^)C+^=lAe,. (1) 

The second containment in ([T]) follows from our induction hypothesis. Now, property [2] of a serial- 
ization implies that (/i*^) ^ (/e)- Combining this with ([T]) we obtain 

(/») c n (/^) = Ae, (2) 

as desired. 

If r[<I>'] is not contained in F, then there exists an edge e of G such that 

(/:) 2 ^e. (3) 

Property [2] of a serialization implies the existence of a function H : T,'^ ^ IliLi ^e*'' such that 
H{f^{m)) = {f^^\m), fi^Xm)) for all m E OJt. Property ji] implies that H is one-to-one, hence 
it has a left inverse: a function G : si*^ — )• Sg such that G o H is the identity. Letting 

F = rii=i/e*^' definition of H implies that F = H o f'^, whence /g = G o F. Applying 
Lemma | C . 1 1 once more, 

(/^)C(F) = +ii(/»), 

and the right side is contained in vlg by ([2]). This contradicts ([s]), which completes the argument. □ 

Lemma C.4 At the start of any iteration of the main loop of GenSerialize(<I>) the following 
invariants hold. 

L ^e = (/i'\...,/i'-')) for slleeE. 

2. Be = ife) n (+eGln{«)^e) for ah eeE. 

3. The collection of subalgebras F = {Ae}e£Eus constitutes a semi-vortex. 

4. $'[F] is a serializable restriction of <I>. 

Proof. The first three invariants can be verified by a trivial induction on the number of loop itera- 
tions. We claim that <&'[F] is serializable, and in fact that the coding functions {fe^} constructed 
in the preceding iterations of the main loop constitute a serialization of [F] . For property [l] of 
a serialization, there is no thin g to check. To prove property 2J observe that fe'^^ ^ = (/g). 



which implies by Lemma C.l that fe^ = b o f'^ for some binary- valued function b on Sg. If 
/g(mi) = /e(nT-2) then 6(/g(mi)) = 6(/g(m2)), which establishes property [2j To prove property [sj 
observe that Ag = (/g) is generated by the functions /i^"' ^\ so if /e(mi) / fe{i^2) then there is 
some j < i — 1 such that fe^\mi) 7^ fe\iT^2)- Finally, property kl follows from the structure of the 
algorithm itself. Either fe is the constant function 0, in which case there is nothing to prove, or 
fe ^ is equal to the function Xe chosen in line [o] of the j^^ loop iteration of GenSerialize($). In 
that case Xe belonged to the Boolean algebra Bg at the start of that loop iteration, which means 

Xe G (/e) n (+gg|n(u)Ag) C +gg|n(„)^e = +eeln{M) ( + l<^<j(/i^'') 



(i) 

and another application of Lemma C.l implies the existence of the function he required by the 



definition of serialization. □ 
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Lemma C.5 When GenSerialize($) terminates, F = {^e}ee-Bu5 is a GIV. 



Proof. Lemma C.4 ensures that F is a semi-vortex, and the algorithm's termination condition 
ensures that there is no edge e such that 7^ B^.. In hght of the fact that B^. = (/e) n (+e6in(M)^e) , 
this implies that F = {Ae}ee-Bu5 is a GIV. □ 



Theorem C.6 (Restatement of Theorem 4.5) In the ordering of semi-vortices by containment, 
the ones corresponding to partial serializations have a maximal element and the GIV's have a 
minimal element. These maximal and minimal elements coincide, and they are both equal to the 
semi- vortex F = {Ae}eeEus computed by GenSerialize((I>). 



Proof. By Lemmas C.4 and C.5, F is a GIV and = $'[F] is a serializable restriction of <I>. If ^" is 



any other serializable restriction of then Lemma [C. 3 implies that F[<^"] C F because F is a GIV. 



If T is any GIV, then Lemma C.3 implies that T 5 F [<!>'] = F becase is a serializable restriction 



of □ 



Theorem C.7 (Restatement of Theorem 4.6) For a network code $ with finite alphabets, the 
following are equivalent. 

1. $ is serializable. 

2. GenSerialize(<^) outputs {Ae}eeE s.t. Ve, = (/e). 

3. $ has no nontrivial GIV. 



Proof. In the proof of Theorem 4.5 we saw that the subalgebras {^e} at the time GenSerialize(<I>) 
terminates constitute a GIV F such that: 

• F is contained in every other GIV; 

• <I>'[F] is a serializable restriction of <I>; 

• F contains F[<I>"] for every serializable restriction of 

Let ^' = <I>'[F]. We now distinguish two cases. 

Case 1: <!>' is isomorphic to <I>. In this case, we show that all three equivalent conditions 
hold. First, ^ is serializable because is. Second, the fact that is isomorphic to $ means 
that (/e) = (/g) = Ae for every edge e. Finally, we know that every GIV in <I> contains F. But 



F = F[^>'] = F[^>], which is the trivial GIV. By Definition 4.4, any GIV containing the trivial GIV 
is trivial. Hence $ contains no nontrivial GIV. 

Case 2: is not isomorphic to <I>. In this case, is a proper restriction of hence the 
semi- vortex F = F[<I>'] constitutes a nontrivial GIV. Any serializable restriction of $ satisfies 
F[$"] C F C F[<I>]. In particular this means that $ is not a serializable restriction of itself, i.e. ^> is 
not serializable. Finally, the statement that <!>' is not isomorphic to ^ means that for some e G E, 
if'e) / ife)- Recalling that (/g) = Ae, this means that GenSerialize(<I') does not output {Ae}eeE 
such that ye,Ae = (/e). □ 



D Analysis of the hitting set reduction 

Reduction D.l (Hitting Set to Minimum LSD($)) Given a hitting set instance (N, S) with uni- 
verse = {1, and a family 5 C 2^ of subsets of A^, we let S{i) = Aj(2)) ■■•j^i{ni)} ^ S 
where A G S{i) iff i G ^, and the ordering of As is arbitrary. We make the network coding instance 
$ = {G,Tl, {Sg}, {/g}) where G is a directed, sourced graph with vertex set: {s}UVUUUWUP, 
where V = {vi,V2, f™}, and similarly for U, W and P; and source edges: {(•, s)}U{(», 'Wi)\i £ N} 
with messages /(,^^) = UieNllA&s{i) and /(,^^^) = l\A&s{i)^/'- Where ah X,Y are uniform 
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random variables over for some sufficiently large ^ > to be chosen later. Rather than enumer- 
ate our edge set E and the coding functions on each edge, we will just specify the coding functions 
for each edge in E. If a function /e is not specified, then e is not in E. We also show the network 
coding instance pictorially in Figure [sj We use Yl denote the n-fold cartesien product, so HILi -^i 
is synonomous with the ordered n-tuple {Xi,X2, The coding functions are as follows: 

k=l j£N:j^i k=l 

k=2 A£S(i)nS{j) k&A 

= n ^t^'"' ' ^ ^ h^^^v^) = E ^t^'"' ' e ^ 

k=l k=l 

rii 



k=l 



/fe,^.) = X^ > Vz e AT /(^^,,^) = n + , Vz G AT 



fc=l 



Proof of Part 1 of Lemma |5.5t Given a hitting set instance ( A^, S) we create the network code <I> 
using the Reduction |D.1[ 

We show that {N, S) has a hitting set of size k if and only if LSD(<I>) < k. 

(=^) Suppose [N, S) has a hitting set of size k. We show that LSD($) < k. 

Let C be a hitting set of size k. Consider adding bit xf"''^-' for c G C to edge {uc,Wc)- This 

(1 k) (1 k) 

allows us to serialize all bits in the following stages, implicitly we are defining fe and ' for 
ah ee E: 

A 

1. For all c G C, we can serialize all bits on edges {wc,Uc) and {uc,Wc)- Wc learns Xc " , so it 

can send bit Xc + Yc "^^^ to Uc- Now, this allows Xc "^^'^ + Yc "^^^ to be sent on (uc,ifc), 
and we continue in this way until all bits serialized on these two edges. 

2. For all c G C, send f{yj^,v^) = Y\k=i ^^"^''^ on {wc,Vc). 

3. For all i G A^, send /(s^^^) njgTVijyi DfeLi ^f'^''^ on {s,Vi) 

4. For every set A ^ S there is an element c G ^ H C because C is a hitting set. Thus, 
there is a vertex in y, Vc that knows X^. Vc can therefore send bit t{A) = ^a<^A-^a 

Va,a & A,a c. Now every Va knows t{A) and can send it along {va, Va>) for all a' £ A,a' ^ a. 
This serializes all bits on edges between vertices in V. 

5. Now every vertex Vi knows every bit X: it received all but HfcLi xf^'^''^ in step 3, and 
determined JlfcLi ^i^*''"' ™ step 4. So, we can send f(vi,pi) = J2k=i ^f'^'^''^ edge {vi,pi). 

6. At Pi we can add the code X]fcL2 xf^^''^ from {s,pi) to f(vi,p^) to obtain X^^''^' and send it on 
{pi,Wi). 
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7. Now, for all i £ N — C, we can serialize {wi,Ui) and (uj, Wi) as we did in step 1. 
(<^) Suppose that LSD(<I>) < k. We show that {N, S) has a hitting set of size k. 

Consider the partition of E into sets E{i)\/i G N such that E{i) = {{■ ^ -^1) that is 
e E -E'(^) if and only if the tail of e is indexed by i. 

Lemma D.2 For a set A £ S suppose no bits are added to any edge in |Jjg^£'(i), then the bit 
t{A) = Y^ai^A-^a on {va,Va'), ya,a' G A cannot be serialized. 

Lemma |D.2| implies that for every set A £ S, at least one bit must be added on an edge 
in U^g^£'(z) to serialize <I>. In particular, if $' is a minimal serializable extension of <I> and let 
C = sends at least one additional than $ on some edge in E{i)} then C is a hitting set for 

{N,S). And |C| < LSD($) < k. 

Proof of Lemma \D.^ Let V{A) = IJaeA'^a- ^"^y edge e going into any node in V{A) must send /e 
because no bits are added on any of these edges. In any serialization, it must be that for some Va £ 
V{A)^ bit t[A) is sent on (vq, Va') for some a' before Va receives t{A) from any Va" £ y{A). Without 
loss of generality, suppose this vertex isvi^i £ A. Consider the subgraph induced by vertices indexed 
by i. There is an information vortex on this subgraph with Wf^g^^.-^ = = T(^s,Ui)j and 

We = for all other edges in the subgraph. To check this one simply has to verify that for edges e 
out of Vi TePir^g = 0, and similarly for Uj. This implies that this subgraph is not serializable. We 
don't add any bits to the subgraph by hypothesis, so to "destroy" this IV, and serialize the subgraph 
we need W(^y.^y-^ + I^(s,i,j) to have a non-zero intersection with T^^Vi,-)- But this is a contradiction to 
our choice of i. □ 

□ 

Proof of Part 2 of Lemma |5.3t Given a hitting set instance ( A^, S) we create the network code ^ 













(or matroid theory) that the subset T = < 





















using the Reduction D.l If |F| is large enough then one can show using facts from linear algebra 

C F"? for {xi, . . . ,Xp} 

distinct elements in F has the property that any q element subset forms a basis for F*?; in other 
words, T is a realization of the uniform matroid Up^q over F. Partition the p vectors of T into \N\ 
subsets Ti, . . . , T|7v| such that Tj contains Zi vectors, note that X^jg^r — P makes this valid. 

We now consider Here, we will regard the edge alphabet for edge e as a vector in Eg F''. 
The tensor product space allows us to consider q copies of Eg on each edge e without fixing a basis. 
We claim that the extenstion of in which we transmit the extra bits HteTi -^i '^^^ (E)t on edge 
{ui,Wi) for alH G A^ is serializable. 

Observation D.3 Transmitting HteTi "'^i^'^^' ^ along edge {ui,Wi) allows node Wi to learn 
riteT ^t^^'"^ «) t for all A: G {1 . . . n,}. 



Proof. We saw in the proof of the forward direction of the part 1 of Lemma |5.3| that transmitting 
A- along edge {ui,Wi) in $ implies node Wi can learn X- for all A; G {1 . . . rii}. This implies 
that in the g-fold repetition, transmitting xf'''^-' t along edge {ui,Wi) allows node Wi to learn 
xf*'*' (^t for aU A; G {l...ni}. □ 

Observation D.4 If IlteTa -^a ® ^ '^^^ be transmitted along edge {wa, Va) for all a G A then we 
can transmit (t{A) = X^aeA -^a ® edges (fa, fa'); ^.^ ^ ^ ^• 
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Proof. YljeN-jy^a rifcLi X^^'^'^'' (giF"? can be transmitted along (sq, Va) for all a £ A, and that, together 
with HteT -^a ® ^ transmitted along (u7a,fa), allows node Va, for all a G A, to compute a{a) = 

Now, fix i G A. Send a(a) along {va,Vi) for all a G j4,a 7^ z. Each message in the tuple a(a) 
is a linear combination of "^aeA-^a ^ ^'^ ^^'^ hence a legal message on all edges {va,Vi). After 
sending these messages, node Vi knows "^aeA -^a ^ for YlaeA ^« distinct vectors t. Our ZaS form 
a feasible fractional hitting set, thus Xlae^ If — ^^"^ SaeA -^i — ^- ^-element subset of T 
forms a basis of F"?, and so Vi can determine cr{A). We can then send (7{A) on edges (uj, fa), a G A 
and then (fa, fa') for all o' G A, a' 7^ o. □ 



Observation |D . 3 and D.4 together imply that for all sets ^4 G we can send a (A) on the clique 



formed by fa, a G A. A simple argument identitical to the last steps in the forward direction of the 
proof of part 1 of Lemma 5.3 imply that we can serialize the rest of the coding functions. □ 



E Proofs omitted from Section [6] 

The proofs in this section rely on knowledge of tensor products. We include a brief tutorial here 
for convenience. 



E.l Tensor products 

If V,W are any two vector spaces with bases {eY}i£x and {ej^jjgj-, respectively, their tensor 
product is a vector space V <^W with a basis indexed by X x J. The basis vector corresponding 
to an element in the index set will be denoted by ej*^. For any two vectors v = aief 
in V and w = ^j^Y ^^^^^ tensor product is the vector 

f (g) tf = ^ ^ OibjeY (X" 
i j 

inV (g)W. 

Lemma E.l If {xj}jgx and {yj}jej are bases of V, W, respectively, then {xj (g) yj}(ij)^xxj is a 
basis oiV (^W. 

Proof. It suffices to prove the lemma when yj = ej^ for all j G J^. If the lemma holds in this 
case, then by symmetry it also holds when Xj = eY for all i G X, and then the general case of the 
lemma follows by applying these two special cases in succession: first changing the basis of V, then 
changing the basis of W. 

To prove that B = {xj ^Y^ii,j)^ixJ ^ basis oiV® W, it suffices to prove that every vector 
of the form ej^ can be written as a linear combination of elements of B. By the assumption 
that {xijjgj is a basis of V, we know that ef = Y^^^k ^k^k for some finite subset K C I and 
scalars {ak)k&K- Now it follows that = J2k(^K c-k i^k ^ ) ' desired. □ 



E.2 Basis and Rank 

Definition E.2 If 1/ is a vector space with basis B, and W = {VFjjigx is a collection of linear 
subspaces, we say that W is B -compatible if R 5 is a basis of Wi, for all i G X. We say that W 
is basis-compatible if there exists a basis B for V such that W is i?-compatible. 
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Lemma E.3 If 1/ is a vector space and W is a basis-compatible collection of linear subspaces, 
then W can be enlarged to a basis-compatible collection of linear subspaces VV that forms a Boolean 
algebra under + and H. In particular, any three subspaces X,Y, Z £ W satisfy: 

{X nY) + z = {X + z) n {Y + z) 
{X + Y) n z = {X n z) + {Y n z). 

Proof. Simply let VV be the set of all linear subspaces of C y such that W D B is a basis of 
W. □ 

Lemma E.4 If 1/ is a vector space and W is a basis-compatible collection of linear subspaces, 
then W U {V} U {0} is also a basis-compatible collection of linear subspaces. 

Proof. The proof is a trivial consequence of the definition of basis-compatible. □ 

Lemma E.5 If 1^ is a vector space and X, Y are any two linear subspaces, then W = {X, Y} is 
basis-compatible . 

Proof. Let Bxy be any basis of X OY, let Bx be any basis of X containing Bxy, and let By be 
any basis of Y containing Bxy- All the vectors in Bx U By are linearly independent, because if v 
is any vector that can be expressed as a linear combination of elements of By \ Bx and as a linear 
combination of elements of Bx, then v must belong to both X and Y, hence v € X DY . But the 
only element of X OY that can be expressed as a linear combination of elements of By \ Bx is the 
zero vector, because Bxy is disjoint from By\Bx, and these two sets together constitute a basis of 
Y. Hence Bx U By can be extended to a basis B of V, and then W = {X, Y} is i?-compatible. □ 

If A, B are subspaces of vector spaces V, W, respectively, then B is defined to be the linear 
subspace ofV <^W consisting of all linear combinations of vectors in the set {a<^b : a € A, b € B}. 
If V is a collection of linear subspaces of V and W is a collection of linear subspaces of W, then 
V (X" W denotes the collection of all linear subspaces A^ B C V (g) ly such that A ^ V and i? € W. 

Lemma E.6 If V is a basis-compatible collection of linear subspaces of V and W is a basis- 
compatible collection of linear subspaces of W then V (8) W is a basis-compatible collection of linear 
subspaces of V iSiW. 

Proof. If B, B' are bases of V, W, respectively, such that V is ^-compatible and W is i?'-compatible, 
then V (^W is {B X i3')-compatible. □ 

Corollary E.7 If X,Y are subspaces of a vector space V and Z is a subspace of another vector 
space W, then 

[{X (^w) + {Y(^ w)] n{V(^z) = [{X ly) + (F z)] n [{y ® vf) + (y ® z)] . 



Proof. By Lemmas E.4 and E.5, we know that V = {X, y, y} is basis-compatible in V and W 



{Z, W} is basis-compatible in W. Hence V (8) W is basis-compatible in V (^W. The corollary now 



follows by applying Lemma E.3 □ 



Definition E.8 If V, W are any vector spaces, a rank-one element of y (8) is an element that 
can be expressed in the form v'^w for some v € V, w £ W. The rank of an element x G F (8 is the 
minimum value of r such that x can be expressed as a linear combination of r rank-one elements 
of V (SiW. (If X = then its rank is defined to be 0.) 
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Lemma E.9 If V, W are finite-dimensional vector spaces and x E V W then the rank of x is 
bounded above by min{dim(y), dim(VK)}. 

Proof. Let n = dim(y),m = dim(VF). We will assume without loss of generality that n <m and 
prove that the rank of x is at most n. Let {e-^} and {e]^} be bases of V, W, respectively. We may 
express x as a linear combination 

n m 

X = ^^aijeY (^ef. 
i=i j=i 

For I < i < n let Hi = YlY=i ^ij^Y ' "^^^^ 

n 

x = ^eY (g) yi, 
1=1 

and this expresses x as a sum of n rank-one elements, implying that the rank of x is at most n. □ 

Lemma E.IO If V, W are finite-dimensional vector spaces of dimension n, m, and P QV (gW is 
a linear subspace of dimension p, then there exists a pn-dimensional linear subspace Q Q W such 
that P C y Q. 



Proof. Let {xi, . . . , Xp} be a basis of P. By Lemma E.9 we can write each Xk in the form 



n 

Xk = '^Vki^Wki, 

1=1 



for some vectors Ufci(l < k < p, 1 < i < n) m V and Wki{l < k < p, 1 < i < n) in W. The vectors 
{wki} span a subspace of W of dimension at most pn. Taking Q to be any pn-dimensional subspace 
of W containing {wki}, we see that P C V Q as desired. □ 

E.3 Non-serializability 



Theorem E.ll (Theorem 6.2 restated) For a linear network code $ = {G,Tl, {Sg}, {/e}) over a 
field F, then LSD(<I>"') > cn where c is a constant dependent on <I>. 

Proof. If a linear network code has an information vortex {We}eeE then {W^}eeE constitutes an 
information vortex in the product of n copies of the network code. Using the fact that = V<S'¥'^ 
for every vector space V, we may rewrite the information vortex as {We ® F"}. 

Now suppose that T is a linear subspace of fg F" of dimension p. (Think of T as a set of 
bits that are added to $ to get an extention of (j).) Using Lemma E.IO there is a subspace Q C F" 
of dimension pd (where d = dim(9Jl*)) such that T C 9Jt* <g Q. Define a new family of subspaces 

Xe = iWe^¥'') + {Tt*0Q). 

These subspaces constitute an information vortex. To verify this, we must check that 

(We «) F") + (m* Q) = [(Te F") + {m* Q)] n [+e'eln(e) ((W^e' ^ F") + {Tl* Q))] . (4) 

Because {We (X" F"} is an information vortex, we have 

We ^ F" = (Te F") n {+e'eln(e)We' ® F") . 
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Equation Q now follows by applying Corollary E.7 with X = T(,,Y = +e'ein{e)We' , Z = Q. Thus 
the collection of subspaces {^e} constitutes an information vortex in the setting where T added 
to every edge of the network code. If the network code is serializable in this setting, then {X^} 
must be a trivial information vortex, implying that Xe = Te (8) for every edge e. Because 
{Vl^e} is non-trivial, we know there is at least one edge e such that dim(VFe) < dim(Te). Let 
m = dim(Te), k = dim(VFe)- The dimension of We F" is kn. The dimension of ® Q is pd?. 
Hence the dimension of X^, is bounded above by kn + pcP. On the other hand, if the information 
vortex {Xe} is trivial, then Xe = Tg (S" F" implying that dim(Xe) = mn. Thus 

. ^r^, m — k 

LSD $ >p> — ^ -n. 

□ 
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